|
The parallelization contract or PACT programming model is a generalization of the MapReduce programming model and uses second order functions to perform concurrent computations on large (Petabytes) data sets in parallel. == Overview == Similar to MapReduce, arbitrary user code is handed and executed by PACTs. However, PACT generalizes a couple of MapReduce's concepts: * Second-order Functions: PACT provides more second-order functions. Currently, five second-order functions called Input Contracts are supported. This set might be extended in the future. * Program structure: PACT allows the composition of arbitrary acyclic data flow graphs. In contract, MapReduce programs have a static structure (Map -> Reduce). * Data Model: PACT's data model are records of arbitrary many fields of arbitrary types. MapReduce's KeyValue-Pairs can be considered as records with two fields. Apache Flink, an open-source parallel data processing platform has implemented PACTs. Flink allows users to specify user functions with annotations. 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Parallelization contract」の詳細全文を読む スポンサード リンク
|